Backwards compatible extended image format

ABSTRACT

Techniques are provided for encoding an extended image such that it is backwards compatible with existing decoding devices. An extended image format is defined such that the extended image format is consistent with an existing image format over the full range of the existing image format. Because the extended image format is consistent with the existing image format over the full range of the existing image format, additional image information that is included in an extended image can be extracted from the extended image. A base version of an image (expressed using the existing image format) may be encoded in a payload portion and the extracted additional information may be stored in a metadata portion of a widely supported image file format.

BACKGROUND

This disclosure relates generally to an image encoding system and methodthat provides increased precision, increased dynamic range, and a widercolor gamut as compared to many existing image file formats. Moreparticularly, this disclosure relates to an image encoding method thatis backwards compatible with existing devices such that the increasedprecision, dynamic range, and color gamut data does not cause existingdevices to fail.

As is known, digital images are expressed in terms of reference valuesthat define the properties of the image. For example, properties foreach pixel of a digital image may be specified by multiple referencevalues (e.g., R or red, G or green, and B or blue values). Thesereference values are defined in terms of a color model. A color modeldescribes the way that colors can be represented using combinations ofreference values. The set of colors that can be produced according to aparticular color model is a color space. The most common color model forproducing images on display devices such as television screens, computermonitors, tablets, etc. is the RGB color model. The RGB color modeldefines a set of colors that are produced from combinations of varyinglevels (i.e., varying reference values) of red, green, and blue primarycolors.

The CIE 1931 color space chromaticity diagram is illustrated in FIG. 1.Outer curved boundary 105 represents the visible spectrum monochromaticcolors with wavelengths indicated in nanometers. The colors along outercurved boundary 105 progress through a range of purple, blue, green,yellow, orange, and red with increasing wavelength. The chromaticitiesof the red, green, and blue color primaries for a particular RGB colorspace (i.e., the chromaticity where one color channel has a nonzerovalue and the other two channels have zero values) form the vertices ofcolor triangle 115. The gamut of chromaticities that can be representedby the RGB color space are represented by the chromaticities that arewithin color triangle 115. Color triangle 115 corresponds to the sRGBcolor space, the most common of the RGB color spaces. Vertex 110A is thesRGB red primary, vertex 1108 is the sRGB green primary, and vertex 110Cis the sRGB blue primary. The D65 white point, the point at which all ofthe color channels are equal to one, is illustrated at 120.

As indicated in FIG. 1, typical color spaces such as the sRGB colorspace encompass substantially less than the full range of chromaticitiesthat are visible to humans. In addition, typical color spaces arecapable of representing only a small portion of the brightness levelsthat can be perceived by humans. These color space limitations have beenincorporated into commonly used color spaces by design based on thecolors that display media are capable of producing. That is, colorspaces need only encompass the colors that can be produced by existingdisplay media such as television displays and computer monitors. Infact, the precision with which colors can be produced (for a given datasize) is increased where the color space is limited to only those colorsthat can be produced. With the advent of new display technologies thatare capable of producing a wider color gamut (i.e., a wider range ofchromaticities) and increased dynamic range (i.e., a wider range ofbrightness levels), it will be necessary to define images in terms ofcolor spaces that include a wider range of colors. However, during thistransition, it will also be necessary that image files carrying theadditional color information can also be read and rendered by existingdevices. It would therefore be desirable to specify a method and systemfor encoding images that provides increased precision, increased dynamicrange, and a wider color gamut as compared to existing image fileformats and that is also compatible with existing display devices.

SUMMARY

A method of encoding an image having extended image content may includeobtaining a first image expressed in a first image format and obtaininga second image that corresponds to the first image and is expressed in asecond image format. Each element of the first image may be defined byreference values in a first range and each element of the second imagemay be defined by reference values in a second range. The first rangemay be a proper subset of the second range such that the first formatand the second format are consistent over the complete range ofreference values for the first format. In one embodiment, the firstimage may then be subtracted from the second image to obtain a deltaimage. The first image may be encoded in the standard payload portion ofan image file and the delta image may be encoded in a metadata portionof the image file. The method may be embodied in program code and storedon a non-transitory medium. The stored program code may be executed byone or more processors that are part of, or control, a system that isconfigured to implement the method.

A method of decoding an image having extended image content may includedecoding a payload portion of the image file to generate a first image.The first image may be expressed in a base image format where each imageelement is defined by reference values in a first range. A metadataportion of the image file may be decoded to generate additional imagedata. The additional image data may be combined with the first image togenerate a second image. The second image may be expressed using anextended image format where each image element is defined by referencevalues in a second range. The first range may be a proper subset of thesecond range such that the base image format and the extended imageformat are consistent over the complete range of reference values forthe base image format. The method may be embodied in program code andstored on a non-transitory medium. The stored program code may beexecuted by one or more processors that are part of, or control, asystem that is configured to implement the method.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates the CIE 1931 color space chromaticity diagram withthe sRGB color gamut specified.

FIG. 2 is a block diagram that illustrates the components of an extendedimage format in accordance with one embodiment.

FIG. 3 is a block diagram that illustrates an operation to extract theadditional information encoded in an extended image format in accordancewith one embodiment.

FIG. 4 is a block diagram that illustrates an operation to extract theadditional information encoded in an extended image format and toseparate the extracted information into different channels in accordancewith one embodiment.

FIGS. 5A through 5C are block diagrams that illustrate processes toencode a base image and extracted image data from an extended image intoa widely supported image format in accordance with multiple embodiments.

FIG. 6 is a block diagram that illustrates the encoding of a base image,one or more channels of extracted additional image data, andinstructions for reconstructing an extended image in a widely supportedimage format in accordance with one embodiment.

FIG. 7 is a flow chart that illustrates a process for decoding anextended image that is encoded using a widely supported image format inaccordance with one embodiment.

FIG. 8 shows an illustrative electronic device in accordance with oneembodiment.

DETAILED DESCRIPTION

This disclosure pertains to systems, methods, and computer readablemedia for encoding an extended image such that it is backwardscompatible with existing decoding devices. In general, an extended imageformat is defined that uses channel primaries that match an existingimage format. Because the extended image format references the channelprimaries of an existing image format, additional image information thatis included in the extended image format can be extracted throughcomputationally inexpensive operations as described below.

In the following description, for purposes of explanation, numerousspecific details are set forth in order to provide a thoroughunderstanding of the inventive concept. As part of this description,some of this disclosure's drawings represent structures and devices inblock diagram form in order to avoid obscuring the invention. In theinterest of clarity, not all features of an actual implementation aredescribed in this specification. Moreover, the language used in thisdisclosure has been principally selected for readability andinstructional purposes, and may not have been selected to delineate orcircumscribe the inventive subject matter, resort to the claims beingnecessary to determine such inventive subject matter. Reference in thisdisclosure to “one embodiment” or to “an embodiment” means that aparticular feature, structure, or characteristic described in connectionwith the embodiment is included in at least one embodiment of theinvention, and multiple references to “one embodiment” or “anembodiment” should not be understood as necessarily all referring to thesame embodiment.

It will be appreciated that in the development of any actualimplementation (as in any development project), numerous decisions mustbe made to achieve the developers' specific goals (e.g., compliance withsystem- and business-related constraints), and that these goals willvary from one implementation to another. It will also be appreciatedthat such development efforts might be complex and time-consuming, butwould nevertheless be a routine undertaking for those of ordinary skillin the art of image processing having the benefit of this disclosure.

Referring to FIG. 2, a base image format may describe an image in termsof reference values 205 that define the properties of image pixels. Forexample, each pixel of an image expressed in an RGB format may includereference values for a red channel, a green channel, and a blue channel.Collectively, the reference values for the red, green, and blue channelsdefine the properties of the pixel within a given color space (i.e., acolor space defined by the red, green, and blue channel primaries). Thereference values may be described in terms of nominal values that rangefrom 0.0 to 1.0. For example, an image pixel having a value of(1.0,0.0,0.0) (expressed as (R, G, B)), would be a pure red pixel withthe highest possible brightness (i.e., a pixel having the properties ofthe red channel primary). In the illustrated embodiment, as is commonfor popular consumer image formats, each reference value can beexpressed as an 8 bit binary number. One of ordinary skill in the artwill recognize that other bit depths are possible. For example, 16 bit.

In one embodiment, an extended image format is consistent with the baseimage format over the range of reference values of the base imageformat. Therefore, the extended image format references the sameprimaries as the base image format (e.g., Red, Green, and Blue).However, the nominal range for reference values 210 may be extended toencode additional image data (i.e., image data that cannot berepresented using the base format such as, for example, gamut, dynamicrange, and precision). That is, the range of reference values of thebase image format is a proper subset of the range of reference values ofthe extended image format. Because the extended image format isconsistent with the base image format over the range of reference valuesfor the base image format, reference values within this range (i.e.,nominal values of 0.0 to 1.0 in the illustrated embodiment) representthe same color properties in either the base image format or theextended image format. In the illustrated embodiment, the nominal rangefor each reference value 210 in the extended image format is doubled ascompared to the base image format reference value 205 (from 0.0-1.0 to−0.75-1.25). This extended nominal range may be utilized to encodeincreased brightness and a wider color gamut than can be produced usingthe base image format. In one embodiment, negative values may beutilized to encode colors that are outside of the base image formatgamut (i.e., outside of the color space of the base image format). Forexample, a value of (1.0, −0.75, −0.75) may represent a very saturatedred that cannot be represented in the base image format. Over unityvalues may be utilized to encode increased brightness that cannot berepresented using the base image format (i.e., outside of the dynamicrange of the base image format). For example, (1.25, 0.0, 0.0) mayrepresent a much brighter version of the red primary. In addition, inthe illustrated embodiment, reference values for the extended imageformat are expressed using 10 bit binary numbers. Accordingly, in thisembodiment, one additional bit may be utilized to double the nominalrange as compared to the base image format and another additional bit todouble the precision throughout the increased nominal range. Oneadvantage of this approach is that when combined with non-linear gamma,the effective linear range of the representation is increased. By way ofexample, a 2.2 gamma value in a 0.0 to 1.0 representation is still 0.0to 1.0 linear. In contrast, a 2.2 gamma value in a 0.0 to 1.25representation is actually 1.6 times the range provided by the 0.0 to1.0 linear range.

It should be noted that the illustrated embodiment is provided as anexample only. It is not necessary that the additional image informationof the extended format be distributed in the manner illustrated in FIG.2 (i.e., 75% of the additional range devoted to wider gamut and 25%devoted to increased dynamic range), that the reference values for thebase and extended formats be encoded using any particular number ofbits, or that the reference values correspond to an RGB color model.This disclosure is equally applicable to any extended image format(using any color model) that extends the range of reference values toencode image information that cannot be encoded using a base imageformat while maintaining consistency over the range of reference valuesof the base image format.

Referring to FIG. 3, in one embodiment and as will be described infurther detail below, it may be desirable to identify the differencesbetween an image expressed using the base image format and acorresponding image expressed using the extended image format. Raw image305 may include image sensor data that has either not been processed atall or has only been minimally processed. Raw image 305 may be analogousto a film photography negative in that each may include imageinformation that is not included in a final image format. For example,some of the information captured by an image sensor and included in rawimage 305 may represent brightness levels or colors that cannot beencoded in base image 310 or even extended image 315. As will beunderstood by those of ordinary skill in the art, the conversion of rawimage 305 to extended image 315 and base image 310 may involveoperations that include noise reduction, sharpening, demosaicing, blacksubtraction, highlight recovery, tone mapping etc., all of which areoutside of the scope of the present disclosure. The result of theseprocesses, however, will produce base image 310 that is comprised ofreference values for a base image format such as reference values 205and an extended image that is comprised of reference values for anextended image format such as reference values 210. Extended image 315may include image information from raw image 305 that cannot beexpressed in base image 310.

In addition, although the extended image format is consistent with thebase image format over the range of reference values for the base imageformat, the conversion process between raw image 305 and base andextended images 310 and 315 may result in different reference valueseven for colors that can be expressed within the base image formatrange. For example, if raw image 305 depicts a bright yellow object thatcannot be entirely reproduced within the base image format of base image310, rather than clamping the colors that extend outside of the baseimage color space at the maximum reference values, which results in aflat and unnatural appearance, the conversion process may adjust certaincolors that are within the base image color space such that a morenatural reproduction of the object is obtained. However, when convertingraw image 305 to extended image 315, which is capable of representing abroader color space, it may not be necessary to adjust the object'scolors in the same way as for the base image format. While FIG. 3illustrates the conversion of raw image 305 to both base image 310 andextended image 315, in another embodiment, raw image 305 may beconverted to extended image 315 and base image 310 may be generated fromextended image 315. After each of base image 310 and extended image 315have been generated, base image 310 may be subtracted from extendedimage 315 to extract the difference between the two images, expressed asdelta image 320.

Referring to FIG. 4, because extended image 315 is expressed in terms ofan extended format that is consistent with the base image format of baseimage 310 over the full range of reference values for base image 310,the extraction of delta image 320, which carries all of the additionalinformation contained in extended image 315 and not in base image 310,may be performed as a simple pixel by pixel difference operation. Forexample, reference values for pixel 310A of base image 310 may besubtracted from corresponding reference values for corresponding pixel315A of extended image 315. The resulting reference values define theproperties of pixel 320A of delta image 320. Using the extended formatdescribed above with respect to FIG. 2 as an example, the nominalreference values of pixel 315A may be (1.21, −0.4, −0.3), whichspecifies a bright, deep red color that is outside of the range of thebase image format of base image 310. The reference values ofcorresponding pixel 310A may be (0.98, 0, 0), which represents nearlythe brightest, deepest red that can be produced using the base imageformat. The numeric representations of the 10 bit binary expression ofthe reference values of pixel 315A and the 8 bit binary expression ofthe reference values of pixel 310A are (1003, 179, 230) and (250, 0, 0),respectively. This results in reference values of (753, 179, 230) forpixel 320A of delta image 320.

In one embodiment, it may be desirable to separate the delta image intomultiple delta image channels, each representing a particular feature.For example, in the illustrated embodiment, delta image 320 may beseparated into delta image channels 322, 324, and 326, which representadditional dynamic range, wider color gamut, and increased precision ofextended image 315 with respect to image 310, respectively. The numberand types of delta image channels generally depends on the differencesbetween the extended image format and the base image format. In order toseparate the portions of delta image 320 that are attributable to thesevarious characteristics, the reference values for each pixel of deltaimage 320 may be compared to the reference values for correspondingpixels of extended image 315. For example, referring again to theextended format described with respect to FIG. 2, the nominal valuesfrom −0.75 to 0.0 of the extended image format may be used to expresswider color gamut and the nominal values 1.0 to 1.25 used to expressincreased brightness. These nominal values correspond to numeric valuesof 0 to 393 and 896 to 1023 in the 10 bit expression of the extendedimage format of FIG. 2. Accordingly, the portions of reference valuesthat are within these ranges are attributable to the specifiedproperties. Portions of the reference values for pixel 315A that areattributable to increased brightness can be extracted to generatereference values of (107, 0, 0) for pixel 322A of increased brightnessdelta image channel 322. Similarly, portions of the reference values forpixel 315A that are attributable to wider color gamut can be extractedto generate reference values of (383, 179, 230) for pixel 324A of widegamut delta image channel 324. When these increased brightness and widergamut reference values are subtracted from the reference values of pixel320A of delta image 320, the resulting reference values (263, 0, 0)represent the increased precision between the base image format and theextended image format and the differences in the conversion process froma raw image to the extended image format and the base image format andcan be encoded as the reference values for pixel 326A of increasedprecision delta image channel 326. Although the embodiment illustratedin FIG. 4 depicts the separation of delta image 320 into three differentimages, more or fewer separate image channels may also be used. Inanother embodiment, multiple delta image channels may be encoded,optionally transmitted, and optionally decoded based on a desire tocontrol bandwidth/quality, and the needs of different decoders (i.e., adevice that doesn't support a gamut wider than the nominal range doesn'tneed the −0.75-1.0 channel transmitted to it. Similarly, a device thatsupports higher precision may subscribe to a delta image channel thatprovides that extra range). This approach to generating delta imagechannels is applicable to most existing file formats/imagerepresentations. The described delta image channels may be stored asexplicit metadata, stored implicitly by “stuffing” the data into thefile/stream where legal (perhaps after the end of the existing image inaccordance with the file format's definition), in a separate but relatedfile (“boxcar”), or file system fork.

Referring to FIG. 5A, in order to ensure that the extended image formatis compatible with existing devices, the base image may be compressedand encoded using a widely supported image file format (e.g., JPEG andTIFF). In addition, the delta image (which may be separated into one ormore delta image channels as described above) may be compressed andpackaged as metadata that accompanies the base image according to thestandards of the selected file format. In this way, devices that supportthe extended image format can generate the extended image from the baseimage and the delta image while devices that do not support the extendedimage format can simply disregard the delta image metadata and decodethe base image in the usual manner.

In the embodiment illustrated in FIG. 5A, base image 510A may besubtracted from extended image 515A to generate delta image 520A. Again,because the extended image format is consistent with the base imageformat over the range of reference values for the base image format, thesubtraction may be performed as a computationally inexpensive pixel bypixel difference operation as described above. Base image 510A may becompressed (530A) and encoded as the payload portion of image file 540Ain accordance with the standards of a widely supported image file format(e.g., the JPEG File Interchange Format (JFIF)). Delta image (which mayactually be split into several different image channels) may similarlybe compressed (530A) and encoded in a metadata portion of image file540A. By way of example, the JFIF format includes multiple metadatachannels. In one embodiment, one or more delta image channels may becompressed and stored in one or more of these metadata channels.Although the JFIF format has been described, additional widely supportedimage file formats may also enable similar operation. Because thecompression and encoding processes are dependent upon the selected filetype and are known by those of ordinary skill in the art, theseprocesses are not discussed in detail in this disclosure.

It has unexpectedly been found that delta image channel data is likelyto be significantly spatially coherent and, further, may be anunchanging DC level over large areas of the base image (where the rawsignal is fully representable by the base). As a consequence, while thedelta image channel may be efficiently encoded using the samecompression as the payload (i.e., base image), there are othertechniques that might further help such as a quadtree representationthat would only encode the sparse areas where the delta image channeldata is actually modulated. It may also be beneficial to break the deltaimage channels out into discrete channels (i.e., negative values, hyperunity, and extra precision) to best make use of the individual sparsechannels. By way of example only, it may be that specular highlightsrequire significant hyper unity values, and deeply saturated areasnegative values. With respect to encoding delta image channelinformation, while either lossy or lossless encoding techniques can beused, it may be advisable to use lossless compression. Where lossycompression is used, delta image channel data may be determined(computed) to include the effect of the lossy compression. This may beachieved, for example by compressing, and decompressing the base imagedata before performing the above-described subtraction, effectivelyencoding the compression error in a delta channel and thereby allowing ahigher quality image to be decoded than the base. It has been determinedthat this is an additional use for the delta channel (over gamma, range,and precision).

Referring to FIG. 5B, base image 510B may first be compressed (530B) andthen decompressed (535B) before the image resulting from thecompression/decompression is subtracted from extended image 515B. In thecase of a lossy compression algorithm, the compression/decompressionoperation ensures that the delta image is calculated from the version ofthe base image that will be used to reconstruct extended image 515Busing image file 540B. This reduces error that may occur during theextended image reconstruction process when using a different version ofthe base image than that which was used to generate the delta image. Thecompressed base image may be encoded as the payload portion of imagefile 540B in accordance with the standards of a widely supported imagefile format. Similarly, delta image 520B may be compressed and encodedin a metadata portion of image file 540B.

In some embodiments, it may be necessary to convert the base image anddelta image to a different color model based on the selected fileformat. For example, if the selected file format for packaging the baseimage and the delta image is JFIF and the base image is encoded using anRGB color model (with the extended image being expressed as an extensionof the same RGB color model), the base image and the delta image mayneed to be converted to the Y′C_(b)C_(r) color model supported by JFIF.Until this point, this conversion process has been assumed to be part ofthe compression and encoding of the base image and the delta image.However, in certain embodiments, it may be advantageous to convert thecolor model of the extended image and the base image and to perform thedifference operation (between the extended image and the base image) inthe converted color space. If the payload portion of the selected fileformat is capable of encoding a wider range of colors than the colorspace of the base image (as is the case with the Y′C_(b)C_(r) colorspace of the JPEG standard as compared to the sRGB color space), it maybe desirable to include at least a portion of the delta image in thepayload portion of the image file rather than the metadata portion.Values outside of the 0.0-1.0 RGB unit cube may be represented in theY′C_(b)C_(r) 0-1 unit volume. Common Y′C_(b)C_(r) to R′G′B′ converterstypically clamp R′G′B′ values to 0.0-1.0 so it may be “safe” to encodeextended range R′G′B′ values in Y′C_(b)C_(r) to maintain backwardcompatibility.

Referring to FIG. 5C, base image 510C and extended image 515C may beconverted from a first color model to a second color model (550). In oneembodiment, the second color model may be a required color model forencoding an image using a selected file format (i.e., the file format ofimage file 540C). By way of example, if base image 510C and extendedimage 515C are encoded using the base image format and the extendedimage format described above with respect to FIG. 2 and the selectedfile format requires images to be encoded using the Y′C_(b)C_(r) colormodel, base image 510C and extended image 515C are converted from theRGB color model to the JPEG Y′C_(b)C_(r) color model. In one embodiment,the following equations are used to convert the nominal RGB referencevalues:Y′=0.299(R)+0.587(G)+0.114(B)  EQ. (1)C _(b)=−0.169(R)−0.331(G)+0.5(B)+0.5  EQ. (2)C _(r)=0.5(R)−0.419(G)−0.081(B)+0.5  EQ. (3)

The example RGB reference values for pixel 315A of extended image 315(1.21, −0.4, −0.3) and pixel 310A of base image 310 (0.98, 0, 0)described above with respect to FIG. 4 result in converted values of(0.093, 0.278, 1.297) and (0.293, 0.334, 0.99), respectively. After thecolor model conversion, converted base image 512C is subtracted fromconverted extended image 517C to generate delta image 520C. Delta image520C is expressed in the converted color space. A portion of delta image520C may be extracted and added back into converted base image 512C togenerate adjusted base image 514C. For example, the reference values ofconverted base image 512C may be adjusted towards the reference valuesof converted extended image 517C by adding a portion of delta image 520Cto the extent that the resulting reference values of adjusted base image514C are within an acceptable range. Referring again to the aboveexample reference values for corresponding pixels of converted baseimage 512C and converted extended image 517C, the reference values for acorresponding pixel of delta image 520C may be (−0.2, −0.056, 0.307).Some or all of these reference values may be added back to thecorresponding values of converted base image 514C to the extent that theresulting values are within an acceptable range of nominal referencevalues (e.g., 0 to 1). In one embodiment, this may result in referencevalues for a corresponding pixel of adjusted base image 514C of (0.093,0.278, 1) and remaining values for the corresponding pixel of deltaimage 520C of (0, 0, 0.297). Adjusted base image 514C and the remainingportions of delta image 520C (i.e., the portions that were notextracted) may then be compressed (530C) and encoded into a payloadportion and a metadata portion, respectively, of image file 540C inaccordance with the standards of the image file format of file 540C.

Referring to FIG. 6, in one embodiment, the base image and the deltaimage may be compressed and encoded in parallel, perhaps using multipleencoding/decoding units. Note, the “additional” encoding units don'thave to be identical. That is, there may be an opportunity to use adifferent hardware encoder for the delta channel information (than thebase) and thereby permit hardware acceleration for a different codecmaking the resulting device more versatile. Alternately the deltachannel information could be encoded/decoded in parallel to the baseimage in a central processing unit (CPU) based codec with the basehardware accelerated. In another implementation, the encoding of boththe delta and base images may be CPU-based (e.g., using different coresof a multicore processing unit). In the illustrated embodiment, thedelta image has been separated into multiple delta image channels 620A-Cas described above. Rather than compressing and saving base image 610and each of the delta image channels 620A-C, which would require acompress operation, a write operation, and a read operation to encodeeach of the images into image file 640, base image 610 and delta imagechannels 620A-C may be compressed in parallel and encoded in image file640. Base image 610 may be encoded in payload portion 645 of image file640 and each of delta image channels 620A-C may be encoded in a separatemetadata channel 650A-C of image file 640. Although three delta imagechannels are illustrated in FIG. 6, more or fewer delta image channelsmay be used.

In one embodiment, a delta image channel may encode the differencebetween a first compressed version of base image 610 and a secondcompressed version of delta image 610. For example, base image 610 maybe compressed in accordance with the standards of the image file format(i.e., the format of image file 640) and may also be compressed inaccordance with an improved compression algorithm. In such anembodiment, the difference between the two compressed versions may becompressed and encoded as one of the delta image channels 620A-C.

In one embodiment, the separation of delta image channels 620A-C intoseparate metadata channels of image file 640 may enable the selectivetransmission or usage of the delta image channels. In one embodiment, ifit is determined that a recipient device is incapable of utilizing oneor more of the delta image channels, only those channels that are usefulmay be transmitted. For example, if a recipient device is capable ofusing the precision delta image channel to increase the precision of thedisplayed image represented by image file 640 but is incapable ofutilizing the increased dynamic range, wide gamut, or compressiondifference channels, the delta image channels that correspond to theincreased dynamic range, wide gamut, and compression difference may beextracted before image file 640 is transmitted. Likewise, if atransmission medium has limited bandwidth, some or all of the deltaimage channels may be extracted prior to transmission of image file 640.Recognition of downstream decoder capabilities can permit thetransmitting station to manage bit-rate and deal with networkcongestion. Similarly, the receiving decoder may selectively decodedeltas based on known circumstances (e.g., it may choose to not decodenegative delta values when a wide gamut display is not available).

In the illustrated embodiment, image file 640 includes identifier 655and instructions 660 that are each stored in separate metadata channelsof image file 640. Identifier 655 may link originally encoded base image610 to the delta image channels. This linkage may be used to avoid theapplication of delta image data to an altered version of base image 610,which could be catastrophic. For example, if image file 640 is modified(e.g., the representation of base image 610 is rotated 90 degrees), thedelta image data should not subsequently be used to attempt toregenerate the extended image. In one embodiment, identifier 655 may bea hash of all or some portion of original payload portion 645. Inanother embodiment, identifier 655 may be a unique identifier that isstored within original payload portion 645 (rather than in a separatemetadata channel) and may include a format specific marker such as anextra JPEG restart marker that indicates that the data in payloadportion 645 is the original data. Regardless of the specificimplementation of identifier 655, any alteration to payload portion 645would create a mismatch that could be utilized by instructions 660 toabort any subsequent attempt to regenerate the extended image using thepayload portion 645 and metadata channels 650A-C. Instructions 660 mayalso include code that is utilized to reconstruct all or some portion ofthe extended image using some or all of metadata channels 650A-C.

Referring to FIG. 7, instructions 660 of image file 640 may determinewhich version of the image represented by image file 640 is generated.It may first be determined if the data in payload portion 645 matchesidentifier 655 (block 705). In one embodiment, it may be determined if ahash of payload 645 matches a hash of the originally encoded data inpayload portion 645 that is stored in identifier portion 655. In anotherembodiment, it may be determined if payload portion 645 includes aunique identifier that was included with the originally encoded payloadportion 645. If payload portion 645 is consistent with the identifier(the “Yes” prong of block 705), it may then be determined if a devicethat will be used to display the image supports the extended imageformat (block 710). If payload portion 645 does not match the identifier(the “No” prong of block 705) or if the extended image format isunsupported (the “No” prong of block 710), payload portion 645 may bedecompressed (block 715) to generate base image 610. If, however, theextended image version is supported (the “Yes” prong of block 710),payload portion 645 and metadata channels 650A-C may be decompressed(block 720) and the resulting delta image channels may be added to theresulting base image to generate extended image 730. In one embodiment,less than all of metadata channels 650A-C may be decompressed and addedto the base image. In one embodiment, instructions 660 may include codeto define necessary conversions and sequences for generating extendedimage 730 from the base image and some or all of the delta imagechannels. By defining an extended image format that is consistent with abase image format over the range of reference values of the base imageformat and by separating an extended image into a base image anddifference information between the extended image and the base image,the extended image can be packaged into a widely supported image formatthat is backwards compatible with existing devices.

Referring to FIG. 8, a simplified functional block diagram ofillustrative electronic device 800 is shown according to one embodiment.Electronic device 800 may include processor 805, display 810, userinterface 815, graphics hardware 820, device sensors 825 (e.g.,proximity sensor/ambient light sensor, accelerometer and/or gyroscope),microphone 830, audio codec(s) 835, speaker(s) 840, communicationscircuitry 845, digital image capture unit 850, video codec(s) 855,memory 860, storage 865, and communications bus 870. Electronic device800 may be, for example, a digital camera, a personal digital assistant(PDA), personal music player, mobile telephone, server, notebook,laptop, desktop, or tablet computer. More particularly, the disclosedtechniques may be executed on a device that includes some or all of thecomponents of device 800.

Processor 805 may execute instructions necessary to carry out or controlthe operation of many functions performed by device 800. Processor 805may, for instance, drive display 810 and receive user input from userinterface 815. User interface 815 can take a variety of forms, such as abutton, keypad, dial, a click wheel, keyboard, display screen and/or atouch screen. Processor 805 may also, for example, be a system-on-chipsuch as those found in mobile devices and include a dedicated graphicsprocessing unit (GPU). Processor 805 may be based on reducedinstruction-set computer (RISC) or complex instruction-set computer(CISC) architectures or any other suitable architecture and may includeone or more processing cores. Graphics hardware 820 may be specialpurpose computational hardware for processing graphics and/or assistingprocessor 805 to process graphics information. In one embodiment,graphics hardware 820 may include a programmable graphics processingunit (GPU).

Sensor and camera circuitry 850 may capture still and video images thatmay be processed, at least in part, in accordance with the disclosedtechniques by video codec(s) 855 and/or processor 805 and/or graphicshardware 820, and/or a dedicated image processing unit incorporatedwithin circuitry 850. Images so captured may be stored in memory 860and/or storage 865. Memory 860 may include one or more different typesof media used by processor 805 and graphics hardware 820 to performdevice functions. For example, memory 860 may include memory cache,read-only memory (ROM), and/or random access memory (RAM). Storage 865may store media (e.g., audio, image and video files), computer programinstructions or software, preference information, device profileinformation, and any other suitable data. Storage 865 may include one ormore non-transitory storage mediums including, for example, magneticdisks (fixed, floppy, and removable) and tape, optical media such asCD-ROMs and digital video disks (DVDs), and semiconductor memory devicessuch as Electrically Programmable Read-Only Memory (EPROM), andElectrically Erasable Programmable Read-Only Memory (EEPROM). Memory 860and storage 865 may be used to tangibly retain computer programinstructions or code organized into one or more modules and written inany desired computer programming language. When executed by, forexample, processor 805 such computer program code may implement one ormore of the operations described herein.

It is to be understood that the above description is intended to beillustrative, and not restrictive. The material has been presented toenable any person skilled in the art to make and use the inventiveconcepts described herein, and is provided in the context of particularembodiments, variations of which will be readily apparent to thoseskilled in the art (e.g., some of the disclosed embodiments may be usedin combination with each other). Many other embodiments will be apparentto those of skill in the art upon reviewing the above description. Thescope of the invention therefore should be determined with reference tothe appended claims, along with the full scope of equivalents to whichsuch claims are entitled. In the appended claims, the terms “including”and “in which” are used as the plain-English equivalents of therespective terms “comprising” and “wherein.”

The invention claimed is:
 1. A non-transitory program storage device,readable by a processing unit and comprising instructions stored thereonto cause one or more processors to: obtain a first image expressed in afirst image format, each element in the first image defined by one ormore reference values in a first range of values; obtain a second imagethat corresponds to the first image and is expressed in a second imageformat, each element in the second image defined by one or morereference values in a second range of values, wherein the first range ofvalues is a proper subset of the second range of values; obtain a deltaimage based, at least in part, on the first and second images; encodethe first image in a payload portion of an image file; and encode thedelta image in a metadata portion of the image file.
 2. Thenon-transitory program storage device of claim 1, wherein elements inthe second image format represent one or more of a wider color gamut, awider dynamic range, and higher precision than elements in the firstimage format.
 3. The non-transitory program storage device of claim 1,wherein the instructions to cause the one or more processors to obtain adelta image comprise instructions to cause the one or more processors tosubtract the first image from the second image.
 4. The non-transitoryprogram storage device of claim 1, wherein elements in the second imageformat have a higher precision than the first image format.
 5. Thenon-transitory program storage device of claim 1, wherein theinstructions to cause the one or more processors to encode the deltaimage in a metadata portion of the image file comprise instructions tocause the one or more processors to separate the delta image into two ormore delta image channels, each of the two or more delta image channelsrepresenting a property of the second image that is not represented inthe first image.
 6. The non-transitory program storage device of claim5, wherein the instructions to cause the one or more processors toencode the delta image in a metadata portion of the image file furthercomprise instructions to cause the one or more processors to encode eachof the two or more delta image channels in separate metadata portions ofthe image file.
 7. The non-transitory program storage device of claim 6,further comprising instructions to cause the one or more processors toselectively extract one or more of the delta image channels from theimage file before the image file is transmitted or utilized to generatean image display.
 8. The non-transitory program storage device of claim7, wherein the instructions to cause the one or more processors toselectively extract one or more of the delta image channels from theimage file before the image file is transmitted comprise instructions tocause the one or more processors to selectively extract the one or moredelta image channels based, at least in part, on one or more propertiesof a recipient device or a transmission medium to the recipient device.9. The non-transitory program storage device of claim 5, furthercomprising instructions to cause the one or more processors to encodeinstructions to combine the first image and some or all of the two ormore delta image channels in the image file.
 10. A system, comprising:an image capture device; a memory; and a processor operatively coupledto the image capture device and the memory and configured to executeprogram code stored in the memory to: obtain an image with the imagecapture device, generate a first version of the image that is expressedin a first image format, each element in the first version of the imagedefined by one or more reference values in a first range of values,generate a second version of the image that is expressed in a secondimage format, each element in the second version of the image defined byone or more reference values in a second range of values, wherein thefirst range of values is a proper subset of the second range of values,obtain a delta image based, at least in part, on the first and secondimages, encode the first version of the image in a payload portion of animage file, and encode the delta image in a metadata portion of theimage file.
 11. The system of claim 10, wherein the program code tocause the processor to encode the delta image in a metadata portion ofthe image file comprises program code to cause the processor to separatethe delta image into two or more delta image channels, each of the twoor more delta image channels representing a property of the secondversion of the image that is not represented in the first version of theimage.
 12. The system of claim 11, wherein the program code to cause theprocessor to encode the delta image in a metadata portion of the imagefile further comprises program code to cause the processor to encodeeach of the two or more delta image channels in separate metadataportions of the image file.
 13. A non-transitory program storage device,readable by a processor and comprising instructions stored thereon tocause one or more processors to: receive an image file; decode a payloadportion of the image file to generate a first image expressed in a baseimage format, the first image defined by a first plurality of referencevalues in a first range of values; decode a metadata portion of theimage file to generate additional image data; and combine the firstimage and the additional image data to generate a second image expressedin an extended image format, the second image defined by a secondplurality of reference values in a second range of values, wherein thefirst range of values is a proper subset of the second range of values.14. The non-transitory program storage device of claim 13, furthercomprising instructions to cause the one or more processors to determinewhether the payload portion of the image file is consistent with anoriginal payload portion of the image file.
 15. The non-transitoryprogram storage device of claim 14, wherein the instructions to causethe one or more processors to determine whether the payload portion ofthe image file is consistent with an original payload portion of theimage file comprise instructions to cause the one or more processors todetermine whether a hash of all or part of the payload portion matches ahash of all or part of the original payload portion that is stored aspart of the image file.
 16. The non-transitory program storage device ofclaim 14, wherein the instructions to cause the one or more processorsto determine whether the payload portion of the image file is consistentwith an original payload portion of the image file comprise instructionsto cause the one or more processors to determine whether a uniqueidentifier is included in the image file.
 17. The non-transitory programstorage device of claim 16, wherein the unique identifier is a fileformat specific marker that is encoded in the payload portion of theimage file.
 18. The non-transitory program storage device of claim 14,further comprising instructions to cause the one or more processors todecode the metadata portion and to combine the first image and theadditional image data only when it is determined that the payloadportion of the image file is consistent with an original payload portionof the image file.
 19. The non-transitory program storage device ofclaim 18, further comprising instructions to generate a display of thefirst image when it is determined that the payload portion of the imagefile is not consistent with an original payload portion of the imagefile.
 20. A device, comprising: a display element; a memory; and aprocessor operatively coupled to the display element and the memory andconfigured to execute program code stored in the memory to: receive animage file, decode a payload portion of the image file to generate afirst image expressed in a base image format, the first image defined bya first plurality of reference values in a first range of values, decodea metadata portion of the image file to generate additional image data,combine the first image and the additional image data to generate asecond image expressed in an extended image format, the second imagedefined by a second plurality of reference values in a second range ofvalues, wherein the first range of values is a proper subset of thesecond range of values, and display the second image using the displayelement.